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Abstract: 

Background: We study mechanisms underlying the collective emotional behavior of Bloggers by using the agent- 
based modeling and the parameters inferred from the related empirical data. 

Methodology/Principal Findings: A bipartite network of emotional agents and posts evolves through the addition 
of agents and their actions on posts. The emotion state of an agent, quantified by the arousal and the valence, 
fluctuates in time due to events on the connected posts, and in the moments of agent's action it is transferred to 
a selected post. We claim that the indirect communication of the emotion in the model rules, combined with the 
action-delay time and the circadian rhythm extracted from the empirical data, can explain the genesis of emotional 
bursts by users on popular Blogs and similar Web portals. The model also identifies the parameters and how they 
influence the course of the dynamics. 

Conclusions: The collective behavior is here recognized by the emergence of communities on the network and the 
fractal time-series of their emotional comments, powered by the negative emotion (critique). The evolving agents 
communities leave characteristic patterns of the activity in the phase space of the arousal-valence variables, where 
each segment represents a common emotion described in psychology. 
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1 Introduction 



The Internet experience in recent years has revolutionized the mechanisms that an individual can exploit to par- 
ticipate in global social dynamics. Consequently, new techno-social phenomena emerge on the Web |[Tl|2][3]|4l, 
boosting an intensive multidisciplinary research. In technology research, for example, new generation of services 
are developing in the direction to integrate human capabilities in a service-oriented manner |[5|. Behavior of the 
users in the virtual world has impact on real-life events, which becomes a concern of both social sciences and 
every day's practice. On the other hand, the data collected from massive use of the Web provide the basis to study 
human behavior "experimentally" at unprecedented scale. For instance, from the high-resolution data stored at 
various Web portals (social networks, Blogs, forums, chat-rooms, computer games, etc) information related to 
user preferences, patterns of behavior, attitudes, and emotions can be inferred for each individual user and user 
communities gathered around certain popular subjects ll6ll7l[8ll9l [l^[TT1[l^[l3l[T4l[T5l[T6l . 

Physics of complex systems and, in particular, the statistical physics of social dynamics, are focused on the 
dynamical processes in which human collective behaviors emerge from large number of individual actions ||4]|6]|71 
|9l- Combining the concepts of statistical physics with the machine-learning methods for the emotion detection in 
texts of messages ifTSlflTl . we have recently performed analysis of large datasets from bbcblog.com and digg.com 
and determined quantitative measures of the collective behaviors in which the emotions are involved 1741 [TSl . 
Complementary to our work in Refs. lfT4llT5l where the empirical data are analyzed to extract various complex- 
systems properties, the present work is a theoretical study of the processes, underlying the emergence of the 
collective emotional behavior of Blog users, within the framework of agent-based modeling. 

The quantitative analysis of users collective behavior in the empirical data from diggs.com and bbcblog.com in 
Refs. lfT4l[T5l has been enabled by mapping the high-resolution data onto bipartite networks of users and posts, 
as two natural partitions. The idea of bipartite networks makes the "firm ground" also in the present theoretical 
model, where the agents interact indirectly over the posts. We also make use of several other features, observed 
in various empirical data, that are relevant for designing the dynamic rules of the theoretical model: 

• Universality of user's behavior related with the action-delay and the circadian cycles ll9ll6l[T0ll7l: 

• User communities occurring in the cyberspace are reminiscent to the ones in real fife, however, different 
time scales and grouping mechanisms might be involved lfT0l[T4l[T8l[T5l : 

• Quantitative measures of emotions have been introduced in psychology research lfT9l . In particular, based 
on Russell's multidimensional model of affect li20i , each known emotion can be represented by a set of nu- 
merical values in the corresponding multidimensional space. Two fundamental components of emotion, to 
which we refer in this work, are the arousal, related to reactivity to a stimulation, and the valence, measur- 
ing intrinsic attractiveness or aversiveness to a stimulation. These components of emotion can be measured 
in laboratory based on the related psychophysiological and neurological activity 1211 l22l . Moreover, a 
systematic association has been recognized ll23l between individual emotional characteristics and word 
use. The arousal and valence components of an emotion can be retrieved from a written text by suitable 
machine-learning methods, which are being developed for a specific type of data Il22ll24ll25l . 

Systematic analysis of the patterns of user behaviors and the emotion contents in the texts of comments in the 
empirical data from popular Blogs lfT4l . discussion-driven Diggs 1151 . and Forums [1261, suggests that negative 
emotions (critique) drive the activity on these Web portals. However, the mechanisms working behind this global 
picture have not been well understood. In order to elucidate the role of emotions in the blogging interactions, and 
to point out potential parameters and levels where the process can be controlled, we devise an agent-based model. 
The agents are spreading their emotions in a bipartite network environment. The agent's properties, the rules and 
the parameters of the model are closely related with the empirical data from Blogs and Diggs. 

Agent-based modeling Il27ll28l l4l. where different properties of agents influence their actions, provides suitable 
theoretical framework for numerical simulations of social phenomena. Recently a model for product-review with 
the emotional agents in a mean-field environment has been introduced 1291 . with the agents emotional states 
described by two state variables (a, , v, ). These variables correspond to the psychological values of the arousal and 
the valence, respectively, in view of the Russell's two dimensional circumples model ll20l[30l[T9ll . 
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Building on the Ref. ||29ll . here we study an agent-based-model to explore the emergence of user communities 
on Blogs, where the emotional contents are communicated indirectly via comments that they leave on the posts. 
For this purpose, our emotional agents are situated on a weighted bipartite network consisting of users (agents) 
and posts, where by definition no direct hnk between the nodes of the same partition is allowed. The weighted 
links between the nodes of different partitions represent the number of comments of an agent to the linked post. 
Motivated by the realistic situation on Blogs, in our model the network itself evolves over time due to the arrival 
of new users and the addition of new posts, and due to user's actions on previous posts of their preference. The 
emotional state, measured by the arousal and the valence variables, which are attached to each agent-node, is 
influenced by time-varying fields on the posts surrounding that agent on the evolving network. As in real-life, the 
elevated arousal may induce an action of the emotional agent on a post, according to the rules introduced below 
in section |2?2l In the moment of action on a post, the agent's current emotion arousal and valence components are 
transferred to the comment that the agent leaves on that post, where it can be experienced by other agents. Thus 
the fields themselves evolve with the network evolution and differ for each agent, depending on its position on 
the network. In order to have realistic dynamics, we design the rules of actions that are motivated by systematic 
observations of the activity patterns in the empirical data at Blogs and Diggs, as described in section UA\ and in 
the supportive information. Moreover, the set of parameters that control the dynamics of our model are inferred 
from the empirical data of popular discussion-driven Diggs, as explained below. 

2 Materials and Methods 
2.1 Datasets 

In this work we use large dataset^ related to popular posts, from which we (i) study the temporal patterns of 
events, that motivate the dynamic rules of our model, and (ii) extract realistic values of certain control parameters 
of the model. As it will be clear below, the present analysis aims for different features of these empirical datasets, 
yielding several new results in comparison with the ones presented in our previous study 1141 [TSl . 

The datasets that we use are collected from bbcblog.com and diggs.com lfT4llT5l and have high temporal resolution, 
information about identity of each user and of each post, as a unique ID, and precise relationship between users 
and comments-on-posts, as well as full text of all posts and comments. The subsets of data related with popular 
posts as posts with more than 100 comments are selected together with all users linked to them, as good candidates 
for the analysis of collective behavior of users lfT4l [TSl . In addition, diggs.com data contain information about 
comment-on-comment. Thus, we select the subset of popular posts, termed discussion-driven Diggs (ddDiggs), 
on which more than 50% of comments represent reply to the comments of other users. This data consists of Np = 
3984 discussion-driven Digg stories, on which Nq — 917708 comments are written by Nu = 82201 users ifTSl . 
In addition, texts of posts and comments are classified by machine-learning methods with the emotion classifier 
designed in Ref. lfT3l and trained at Blog-type of texts. Using the emotion classifier the texts are designated as 
carrying either positive or negative emotion valence, or otherwise are neutral 1141 . 

From the discussion-driven Diggs dataset here we analyze the temporal patterns of activity related to both users 
and posts. Parts of these patterns are shown in Figs. [T^,b. Each user (post) occurring in that dataset is given 
a unique index, plotted along vertical axis, sorted by the time of first appearance in the dataset. For each user 
index, points along time axis indicate the times when an activity of that user occurred to anyone of the posts. 
Analogously, the points on the posts pattern indicate the times when an activity occurred at that post by anyone 
of the users. In the post-activity pattern, shown in Fig.[TJ), dense points in a narrow time window following the 
post appearance time indicate an intensive activity at that post. This might be related with certain exposure of the 
posts to users during that time period. The width of the exposure time window, Tq, will be recognized as a relevant 
parameter in the dynamics. Whereas, different type of the dynamics beyond the exposure window is manifested 
in systematically reduced activity until eventually the post ceases to be active (expires). 

The situation is entirely different when looked from the point of view of the users. The patterns of activity of 
every user over time is shown in Fig. [TH. The user indexes are ordered by the time of their first appearance in 

^Data accessible on http://www.cyberemotions.eu/data.html under terms and conditions of the CYBEREMOTIONS project. 
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the dataset, hence the top boundary of the plot indicates the appearance of new users, relative to the beginning 
of the dataset. The profile of the top boundary shows that new users arrive in "waves". Moreover, the arrivals of 
new users boost the activity of previous users, which is manifested in the increased density of points in depth of 
the plot below each "wave". This feature of the dynamics is utilized for designing the model rules in section IZ21 
Further quantitative measures of the temporal patterns in Figs.[T^,b are given in relation to the model parameters 
in section IT. 2. 3 1 

The number of newly arrived users (with respect to the beginning of the dataset) in suitably defined time bin 
can be readily extracted from the dataset. The time series of new users pit) per time bin (in units of thin = 5 
minutes), inferred from the ddDiggs dataset is shown by the red line in Figure |2] It exhibits characteristic daily 
cycles superimposed on the fractal fluctuations with long-range correlations. Note that these cycles are related 
with the occurrence of "waves" in the activity patterns in Fig [T^. The signal has the power-spectrum of the type 
5(v) ~ 1/v'^, with « 1.5, shown in top panel in Fig.|2l The time series of the number of all active users per 
time bin, extracted from the same dataset is shown in the same Figure|2]by the green line. It has a similar fractal 
structure and the power-spectrum of 1 / v-type. The power spectrum is correlated over the range of frequencies, 
which correspond to times larger than 2 hours in the time domain. Further analysis of this dataset, which is relevant 
for this work, is given in respect to extracting the control parameters of the dynamics in section [2.2.3l Detailed 
analysis of the emotion contents of the comments and the related time series of the emotional comments can be 
found in Ref. fTSll. In the supporting information. Figure SI shows the time-series of the number of emotional 
comments with positive (negative) valence from the same dataset which is analyzed here, and the excess of the 
negative emotions. 

2.2 Dynamics: Model of Emotional Bloggers on Evolving Bipartite Network 

In the spirit of agent-based modeling, the agents (representing users on Blogs) are given certain properties that 
may affect their actions, i.e., the dynamic rules of the model. Conversely, these agent's properties are changed due 
to dynamic interactions between them, which imply further changed actions, and so on. To describe emotional 
actions of users on Blogs, we adapt the agents whose emotion states are described by two variables — arousal and 
valence, first introduced in Ref. |29). Apart from the emotion variables, the agents in our model have additional 
properties indicated below in Eq. ([1]), and are subjected to the dynamically active networked environment, which 
affects their actions. Moreover, thy are designed for multiple actions in course of the dynamics, thus contributing 
to the emergence of collective emotional behavior. 

As mentioned above, here the emotional agents are adapted to interact indirectly via posts on a bipartite network 
of the agents and posts. Thus the essential elements of our model are: 

• 2-dimensional local maps, describing the emotion variables arousal and valence {«,(?), v;(f)} of each agent; 

• Interaction environment, represented by an evolving bipartite network of agents and posts, through which 
the agents emotion is spreading; 

• Driving noise, applied to systematically perturb the system boosting its internal dynamics. In our model the 
system is driven by adding new users, according to the time-series p{t) of new users, which is inferred from 
the empirical data of ddDiggs, as explained above. No other type of driving is considered in this work. 

It should be stressed that the bipartite network type, with the users (agents) and the posts as two partitions and the 
weighted links between them representing the number of comments of the user to the post, is necessary in order 
to take into account the fundamental nature of the dynamics on Blogs: users, as the nodes of the same partition, 
are never connected directly, but only through the posts. Technically, both types of nodes appear as the objects at 
the same level. Therefore, we actually have two types of "agents", user and post agent, with different properties 
indicated here: 

U[g;vi{t),ai{t);ListSi...,At] ; P[tp;< v,,(f) >,<«,,(?) >;ListSp...] . (1) 

The dynamical variables arousal and valence a, (f ) and v, (f ) are properties of the user nodes, which vary in time as 
explained in detail below. In analogy to real Blogs, at the moments of actions the emotion (arousal and valence) 
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of the user are transferred to the post (precisely to the comment that the user puts on the post), and thus contribute 
to the current overall emotional content on that post, < Vp{t) >,< fl;,(f) >. Both user and post objects have 
individual Lists of connections on the evolving bipartite network, which are updated through the user actions, 
as explained below in section 12.2.21 Additional properties that may affect the dynamics in our model are the 
post life-time, tp, and the user action-delay. At, as well as the user probability for posting a new post, g. These 
properties can be inferred from the considered dataset, as done below in section l2".2.3l 

Beside the dynamic rules, which are motivated by the blogging processes, other gross features of our model 
are different from previous work on the emotional agents in Ref. [29|. The relevant differences are due to the 
role of networked environment and the endogenous driving, without any external noise. Moreover, at the level 
of individual agents, apart form the two emotion variables which are given by the same type of the nonlinear 
maps Eqs. (IllO as in Ref. 1291 , the agents in our model have additional properties which affect their actions. 
These are the inclination towards posting new posts and the action delay, measured by the quantities g and Af, 
respectively, as well as the lists of connections (posts) on the network, which are unique for each agent. Hence the 
environmental fields in Eqs. (|2][3l) are practically different for each agent on the network. In this way the network 
environment induces (and keeps track of) the heterogeneity among the agents in a natural way, through the lists 
of posts to which they were connected in the course of their actions. Having these general remarks, we introduce 
details of the model in the remaining part of this section. We first explain the dynamic rules of the model and 
define all the parameters that control the dynamics. 



2.2.1 Emotional states of individual agents 

Following Ref. |f29l, we assume that the individual emotional state (arousal and valence) of each agent can be 
described by two nonlinear equations, which are subject of the environmental fields. For our system on bipartite 
networks, the arousal and the valence are associated with each user-node(!) and their values, kept in the intervals 
a,(f) € [0, 1] and v',(f) G [—1, 1], are updated according to the following nonlinear maps: 

a.(t + i) = l (i-7«)«<(0 + [/'J'(0+?/C/(0](«'i+«'2(«,-(0-«<(0'))(i-«<(0) ifAf,'<i 

' \ (l-7fl)a,(f) otherwise 

and 

/ (l-rv)v,(0 + [/'r(0+?/V0]W(ci+C2(v,(f)-v,-(f)^))(l-|v;|) ifAf,-<l 

v,ir + ij-| otherwise ^ ^ 

where ; = 1,2- • -Nuit) indicates the index of user node and t — the time bin. The coefficients di,d2 and c\,C2 
characterize the maps themselves, while the network environment effects appear through two types of fileds: the 
local fields h"{t) and h){t), and the mean fields h^„f[t) and h''^f{t). Note that the local fields h"{t) and /2)'(f) vary 
not only in time but also from user to user, depending on their connections on the network, and due to evolution 
of the network itself (see details below). Whereas, the mean fields /i",^ (?) and h^„jf{t) may act on a larger number 
of users, while also fluctuating in time. In our model they steam from a currently active posts and, thus, may be 
seen by all users who are attached to these posts. The mean fields indicate how the overall activity moves through 
posts, as a kind of "atmosphere" at the Blogsite. The contribution from the mean fields in our model is taken with 
a fraction < ^ < 1, which is varied as a free parameter in Eqs. (|2| and (O, and it is added to the contributions 
from the local fields. A user-node receives the stochastic inputs from the network in certain instants of time, when 
the events occur in its network surrounding, and reacts to them with a delay. The delay time Af of each user is 
counted continuously. When the delay time is smaller than the computational time bin Af < 1 ti,i„, the user is 
prompted for the update of its arousal and valence according to the full expressions indicated by top lines in the 
Eqs. Q and (O. Otherwise, the arousal and valence values are only relaxing, with the rates y" = Y' = y. 

For understanding the dynamics at local level, let us consider for a moment the Eqs. (|2]) and (O as two-dimensional 
nonlinear maps of an isolated node subjected to a given constant field. In Fig. [3] we show the situation for several 
values of the field pre-factors, while keeping all other parameters fixed to the values which are used later in the 
simulations. Depending on the parameters, the maps can reach different fixed points. In particular, the arousal 
map always leads to an attractive fixed point, the position of which depends on the strength of the field — larger 
arousal is reached when the field is stronger, cf. Fig.jS^. In the case of valence, two fixed points can be reached, 
one at the positive valence, when the fields are positive (upper branch), and the other one in the area of negative 
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valence, which is attractive when the fields are negative (lower branches in the Figure |3j)). In general, when 
the nonlinear maps are coupled on a network, the network environment affects each individual map through a 
feedback loop, causing synchronization ll3n[32l or other self-organization effects [l33i among the nodes. In our 
case the network affects the fields, which thus fluctuate at every time step and depending on the node's particular 
position on that network. The dynamics of the fields can thus be visualized in the local map as jumping of the 
trajectories v; (f ) , a, (f ) from one branch of the map to another branch, and consequently, being attracted to another 
area of the phase space. As will be explained in detail below in section 12.2.21 the nonlinear mapping takes part 
only when the agent is prompted to act. Meanwhile, the maps are only relaxing towards the origin. 



2.2.2 Agents interaction and network evolution: Rules & Implementation 

The fields h"{t) and h"„f{t) in Eq. (|2]i, which affect user / arousal at step f + 1, are determined from the posts in 
the currently active part of the network, 'io{t,t~ 1), along the links of that user. Specifically, 

, _ Ipe^(r/-l)A,>af (0(1 +v,(Ovf (0) . W(^r-i)Q^(0 
'■^^ W(,,-l)A,>«J(0(l+v,(Ovf(0)' "^^^ ' ^ ^ 

where aj(f ) and v^(t) are the total arousal and the average valence of the post p calculated from the comments 
in two preceding time steps, while nj(f ) is the number of all comments posted on it during that time period. A,p 
represents the matrix elements of the network, i.e., A,p > if user / is connected with the active post p, while 
Aip = if there is no link between them at the time when the fields are computed. Note that such links may appear 
later as the system evolves! In Eq. ^ the individual arousal fields h"{t) is modified by (dis)similarity in user's 
actual valence, v,(f), and the valence of recent comments on the post, v^{t). 

Regarding the valence fields in Eq. (O, we take into account contributions from the positive and the negative 
comments separately, while the neutral comments do not contribute to valence field. Depending on the current 
emotional state of the agent, positive and negative fields can lead to different effects 1291 . in particular, positive 
(negative) state will be influenced more with negative (positive) field, and vice versa. Here we assume that both 
components influence user valence, but with different strength according to the following expression; 

^ l-Q.4r,-(f) i:peV(t,,-\)AipN+{t) l+0.4r;(0 Ipe^(r,r-1)^.>A^,7 (0 ^ 

''^^ 1-4 W(M-i)A,pA^7"(r) 1.4 Ipe^(a-i)A,>A^7"(0 ' 

where the valence polarity of the user / is given by r,(f) = and Np{t) is the number of positive/negative 

comments written on post p in the period [f — l,f]. The normalization factor N'j^°{t) is defined as Np'""{t) = 
Np{t) +Np{t). The mean-field contributions to the valence steam from the entire set of currently active posts 
'^^{t,t — 1), and are independent on how users are linked to them: 

^„ l-0.4r,-(f) LpeV{r.r-l)N+it) l+0.4r,-(f) W(r,-l)A^p(0 



However, the mean-field effects are perceived individually by each user, depending on the polarity /•;(?) of user's 
current valence. 

The rules of agents interactions on the network are formulated in view of user behavior on real Blogs and 
Diggs and the observations from the quantitative analysis of the related empirical data. In particular, the dynamic 
rules are motivated by the temporal patterns in Figs.[T^,b and the time-series in Fig.|2] indicating how the number 
of active users arises in response to the arrival of new users. Moreover, additional features of the dynamics on 
ddDiggs, shown in supporting information. Figure S 1 and Movie S2, suggest the dynamics with the dominance of 
negative emotions and with user's focus systematically shifting towards different posts. In the implementation of 
the model rules, we also make use of some general features of human dynamics, i.e., the occurrence of circadian 
cycles and delayed action to the events, mentioned in the Introduction, and assume that the arousal drives an 
action, as commonly accepted in the psychological literature. 

The rules are implemented in the C++ code as follows. The system is initialized with typically 10 Users who are 
connected to 10 Posts, to start the lists of the exposed and the active posts and the prompted and the active users. 
Then at each time step: 
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• The system is driven by adding p{t) new users (Note the correspondence of one simulation step with one 
ffe,-,, = 5 min of real time); Their arousal and valence are given as uniform random values from a,- e [0, 1], 
V, S [— 1, + 1], then updated with the actual mean-field terms. By the first appearance each user is given a 
probability g £ P{g) to start a new post. The new users are then moved to the active user list; 

• The emotional states for all present users are relaxed with the rate y, according to the second row in the 
Eqs. Q and (O; 

• The network area "^(f ,f — 1) of the active posts is identified as post on which an activity occurred in two 
preceding time steps; then the lists of active users is updated from the users linked to these posts, as follows: 

- Users linked to the active posts are considered as exposed to the posted material and decide when they 
will act on it, i.e., they are given new delay-time from the distribution P(Af); All users whose current 
delay time At < If^m are prompted for update the emotional states according to the first rows in Eqs. 
^ and ([3]), with their actual network fields computed from the Eqs. (|4l|6]l. An updated user is moved 
to the active user list with the probability aQai{t) proportional to its current arousal, else it gets a new 
delay time Af G P(Af); 

• Every active user: 

- adds a new post with the probability g or otherwise comment to one of the exposed posts, which 
are not older than Tq steps; Users are linked to posts preferentially with the probability Pp{t) = 

0.5{l+vf(t)vi{t))+Nf,(t) , ,■ , , J- ■ \ 1 1 1 • -1 • 

^ 'f ' l N I \.,r, M . dependmg on the number of comments on it A'„(n and the valence similarity; 

i:,,[o.5(i+v|(0v,(?))+A'^(O] i- c pw J 

- and with probability yi comments a post which is older than Tq steps. The post is selected prefer- 
entially according to the negativity of the charge of all comments on it, with (properly normalized) 
probabilities Pj^oid{t) ~ 0.5 + if the charge is negative, else Pj^oid{t) ~ 0.5; Lifetimes of the 
posts are systematically monitored (already expired posts are not considered); 

- Current values of the valence and arousal of the user are transferred to the posted comment or the new 
post; User is given a new delay-time At € P{At); New posts are given life-time tp E P{tp)- 

• Delay-time At for all other users is decreased by one. Time-step closes with updating the lists of the exposed 
and the active posts, and the lists of the exposed and the prompted users. 

2.2.3 Control parameters: Definitions and inference from tlie empirical data 

According to the above dynamic rules of the model, one can identify the parameters which control the dynamics 
at different levels. In particular, we use the following parameters, distributions or time-series which characterize, 
respectively: 

• the local maps: c\ = d\ = 1, ci = 2.0, d2 = 0.5, 7= 0.05; 

• the properties of posts and users: tp G P{tp), At e P{At), Tq = Idays, IJ.{Tq) ~ 0.05, g G P{g); 

• the driving: {p{t)}, q = 0.4, = 0.5. 

As stated above, the time-series of the number of new users per time bin, {/?(?)}, is shown in Fig.|2] Several other 
parameters and distributions can be also inferred from the high-resolution data of Blogs and Diggs. Specifically, 
in Fig. m we show the distributions of the lifetime of posts, P{tp), time delay of users actions, P{At), and the 
fraction of new posts per user, P{g), as well as the functional dependence of the probability /i(7b) that a user 
looks for a post older than the exposed posts window Tq. These quantities are inferred from the ddDiggs dataset. 
The value Tq = 576 time bins (corresponding to 2 days of real time) can be approximately estimated from the 
posts activity pattern, cf. Fig.[TJ). The numerical values of the remaining parameters can not be extracted from this 
kind of empirical data. Hence they are considered as free parameters, that can be varied within theoretical limits. 
The values quoted above are used for the simulations in this work. 
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Here we describe a general methodology how such parameters are determined from the empirical data of high 
resolution. The delay-time distribution P{At) in Fig.|4]:, is directly related with the user activity pattern, cf. Fig. 
[T^: for a given user (fixed index along y-axis) the delay time At is defined as the distance between two subsequent 
points along the time axis. The distribution is then averaged over all users in the dataset. Similarly, the distribution 
of the life-time of posts P{tp) in Fig.' [Ji, is related to the pattern of posts activity as the distance between the 
first and the last point on the time axis for a given post, cf. Fig. [T]5. The parameter Tq is roughly estimated as 
the width of the time window, during which new posts were 'exposed' (dense points area in Fig.lTJ)). When Tq 
is fixed, then the probability that a user finds a post which is older than 7b can be extracted from the data as the 
fraction of points beyond the dense area in the posts activity pattern until the post expires, cf. Fig.[TJ). Then we 

have /i(7b) = ]^Lp=i (r"I^r[' >to +To l)' where Np is the number of posts, tp is the expire time of the post p, 
while tkp and fop indicate the moments of the activity at the post p and its creation time, respectively. Averaged 
over all posts in the dataset, gives the parameter jLi(7b), plotted in Fig.|4j5 against Tq. In the case of user properties, 
looking at the activity list of a given user, we can determine the fraction g of new posts that the user posted out 
of all posts on which the user were active in the entire dataset. The values appear to vary over time and users, the 
distribution P{g) averaged over time and all users in the dataset is shown in Fig.|4^. Strictly speaking, the values 
of the control parameters will depend on the empirical dataset considered. Specifically, the parameters as the life- 
time of posts, tp, and users inclination to posting new posts, g, or to looking towards old material /i(7b), strongly 
depend on the dataset. Note also that they might have hidden inter-dependences in view of the nonlinear process 
underlying the original dataset. For instance, if on a certain Blogsite users are more inclined towards posting new 
material, which would yield increased probabilities of large g, then the life-time of posts may decline, resulting 
in a steeper distribution. Therefore, it is important to derive these parameters from the same dataset in order to 
ensure their mutual consistency. Note, however, that certain universal features apply, in particular, in the power- 
law dependences of the delay-time |l9][6l distributions P{At), and the circadian cycles Q in the time series {p{t)}. 
Although our model works for a wide range of parameter values, here we keep the parameters extracted from the 
empirical data of the popular discussion-driven Diggs in order to enable a comparison of the results to largest 
possible extent. In contrast, the relaxation rate of the arousal and the valence and the parameters di,d2,ci,C2 of 
the maps in Eqs. (|2]l3]l can not be extracted from this type of empirical data. The values shown above are chosen 
such that the fixed points of the maps do not fall to corner areas for typical values of the environmental fields 
occurring in our simulations, cf. Figs.[3^,b. 



3 Results 

3.1 Time-series of the emotional comments 

As mentioned above, we drive the system by adding p{t) new users at each time step and letting them to boost 
the activity of the system, according to the model rules introduced in sec 12.2.21 We sample different quantities 
in analogy to those that we can define and compute from the empirical data, see section IZTI and Refs. lfT4l [Tsl . 
Compared with the empirical data, the advantage of the agent-based model is that we can keep track of the 
fluctuations in the valence and the arousal of each agent ("user") at all time steps. 

Typically, the arousal and the valence of an agent, who is linked through the posts to other agents, experiences 
stochastic inputs from the active environment, as shown in Fig.|5] Between such events the emotion arousal and 
valence decay with the rate y towards zero values. It should be stressed that at each agent (user-node) different 
patterns of the activity are expected. They depend not only on the current network structure surrounding the agent, 
but also on the fact that at given time the activity might be transferred to another part of the network, i.e., due to 
the aging of posts and the preferences of other agents towards particular types of posts. Two illustrative examples 
shown in Fig. |5] are from the same simulation run, but for two agents who are located at different areas of the 
network. 

The actions of individual agents contribute to the overall activity that can be monitored at each post and at the 
whole (evolving) network, as well as at the network parts, for instance the topological communities, that can be 
identified when the network is large enough. In the simulations we monitor the fluctuations of the number of active 
posts, Nap{t), the number of different agents that are active at these posts, Nau{t), and the number of comments 
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that these agents posted at each time step, Nc{t). Furthermore, we distinguish between the comments that carry 
positive (negative) valence, N±{t), and the overall charge of these emotional comments, which is defined as the 
difference between the number of comments with positive and negative valence, Q{t) = N+{t) — N-{t). The 
temporal fluctuations of these quantities are shown in Fig. |6t,b, where only the initial part of the time-series are 
shown, corresponding to four weeks of real time. Notice, the circadian cycles of the driving signal are reflected to 
the time-series of the number of active agents and the number of their comments. 

The power spectra ^(v) of these time series are shown in the upper panels in Figs.|6]:,d. A characteristic peak cor- 
responding to the daily cycles of the time-series is visible. In addition, long-range correlations with S{v) ^ 1 /v'l' 
occur in most of these time series (except for the charge fluctuations!) for the range of frequencies, indicated 
by the slopes of the straight lines in both Figures. The simulated time-series can be compared with the ones 
observed in the empirical data, for instance Fig.|2]and Figure SI and with similar data analyzed in Refs. lfT4llT5l . 
The fractality of these time series, leading to the power spectrum of the type l/v'^, as well as the dominance of 
the negative charge suggest that our model captures the basic features of the blogging dynamics. Specifically, 
in response to the same driving signal, which has the power spectrum with the exponent = 1.5, the simulated 
blogging process builds the long-range correlations yielding the time-series with smaller exponents = 1.33 and 
(j) — I, in the number of active agents and the number of comments, respectively, and increased range of corre- 
lations, qualitatively similar to the popular Diggs. Further comparison between the simulated and the empirical 
data can be considered at the level of the emergent network topology, studied below in section [321 These time 
series have comprised the agent's activity at the whole (evolving) network. In sec.|4]we analyze the time series 
which are recorded at the level of each emerging agent-community separately. 

3.2 Emergent network and communities of the emotional agents 

The networks that emerge through the activity of our emotional agents on posts can be studied in full analogy with 
the bipartite networks mapped from the empirical data of the same structure, which are discussed in section IZTI 
and in Refs. ifTOl fT4l [341 . (For other interesting examples and review of complex monopartite networks, see 1351 
and references therein). In our simulations the network evolves due to the addition of nodes of both partitions, 
as well as the evolution of links. The lists of user — posts connections are updated at every time step. With 
the prevailing negative comments, as demonstrated above in the time-series, cf. Fig. [6] arrivals of the negative 
comments at posts generate an environment that, in view of the linking rules (large negative charge preference), 
strongly affects the network evolution (see also discussion in sec.[4li. In this way some posts with a large number 
of negative comments and thus large topological strength may appear. A part of the emergent network obtained 
in our simulations is shown in Fig. [7] where three such hubs — popular posts are visible together with the users 
linked to them. 

Some topology measures of the emergent bipartite network — the degree distributions and the assortativity mea- 
sures, are shown in Fig. [??k.b. For comparison, the corresponding topology measures of the network obtained 
from the empirical dataset of discussion-driven popular Diggs are also computed and shown in supporting infor- 
mation. Fig. S3. As expected, the degree distributions for each of the partitions — agent(user)-nodes and post- 
nodes appear to be different (some other examples of bipartite networks representing the empirical data of various 
techno-social interactions have been analyzed in |[34l ). Specifically, the broad distributions are dominated by 
different type of cut-offs. They can be approximated by the following mathematical expressions, motivated by 
fitting the corresponding empirical data, cf. Fig. S3: 



for the agent(user)-node and the post-node distributions, respectively. The agent-degree distribution P{qu) ex- 
hibits a short power-law region and a large exponential cut-off, whereas the distribution related to the post-nodes 
P{qp) has a dominant cut-off at smaller degree followed by a power-law tail, compatible with the ^'-exponential 
form with ^ > 1 . The fitted values of the parameters are shown in the Figure legends in Fig.[??b. In principle, they 
depend on the simulation parameters of the agent-based model. However, the expressions appear to be stable with 
respect to the simulation time (size of the network). The results are shown for two simulation runs with 16384 
and 25000 time steps, resulting in the networks with A^^ = 13504 +A^t7 = 64852, sndNp = 22151 +Nu = 107933 





(7) 
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nodes, respectively. The same mathematical expressions in eq. O apply (with different parameters) to the cor- 
responding distributions derived from the empirical data, cf. Fig. S3a. Hence, in these measures the topology of 
the bipartite network emerging in the emotional blogging of our model shares qualitative similarity with the one 
from the popular posts in real data. To certain extent, similar conclusions apply in the case of the mixing patterns, 
shown in Fig.[??b and in Fig. S3b. Namely, the posts making the network neighborhood of an agent-node (user- 
node in the empirical data) exhibit no assortativity measure, which is indicated by the line of zero slope before 
a cut-off at large user-node degree. In the case of post-nodes, the empirical data indicate slight disassortativity 
(decrease) just before the cut-off. Fig. S3b, a feature that seems not be properly captured by our model in the 
present parameter range. Systematic study of the network topologies, emerging when the parameters of blogging 
dynamics within the agent-based model are varied in a wide range away from their empirical values, is left for a 
separate work [361 . Here we focus on the mesoscopic structure of the emergent network obtained for the current 
set of parameters, which are listed above. In particular, we are interested in communities of the emotional agents, 
that may potentially occur on the respective monopartite projection of the network. 

Communities, as topological subgraphs with stronger connections between the nodes inside the community com- 
pared to the rest of the network, can be accurately identified using different methods ||37l[38l[39ll40| . The bipartite 
networks representing the dynamics on Blogs and Diggs exhibit abundance of different communities lfT0l[T4l[T5l . 
A systematic analysis of such bipartite networks, representing various online interactions, can be found in 1341 . 
The monopartite user- or post-projections of these networks appear to be highly clustered and weighted networks 
lfT0l[T4l[T5l[34ll . which limits applications of classical methods of community analysis BOl . Therefore, we use 
the methods based on the eigenvalues spectral analysis of the network HTl [39l and the maximum likelihood 
method adapted for multi-graphs ||42l . The networks emerging through the actions of the emotional agents in our 
simulations have similar features. 

We perform the spectral analysis of the normalized Laplacian operator B3l [39l which is related to the weighted 
user-projection network, whose matrix elements represent the common number of posts per pair of users, 
including the multiplicity of user-post connections, which is indicated by the superscript. It is constructed from 
the symmetric matrix of commons as 



where /,■ is the strength of node defined as the sum of wights of its links. As discussed in detail in Refs. 
the spectrum of the Laplacian ^ is limited in the range A, G [0,2]. When the communities exists, the lowest 
non-zero eigenvalues of the Laplacian ([8]) appear separated from the rest of the spectrum and the corresponding 
eigenvectors are localized on the network subgraphs (communities). The localization of the eigenvectors is visu- 
alized as a characteristic branched structure of the scatter-plot in the space of these eigenvectors. This property 
of the eigenvectors is then utilized to identify the nodes of the network that belong to each community (detailed 
discussion and various examples studied by this methods can be found in ll39ll44l[T0l[T5l ). It should be stressed 
that the network grown through the emotional actions of of the agents in our model has specific properties which 
may be reflected in the community structure. Namely, the bipartite network is already weighted, which shifts the 
distribution of weights in the monopartite projection away from pure topology of commons |f34'|. In addition, the 
network evolves in such way that the center of the activity is shifting to ever new groups of (exposed) posts. 

Here we analyze the structure of the network after 4032 time steps (two weeks) of the evolution. The network 
projected onto user (agents) partition contains Nu = 4572 users, only users with the degree larger than 5 are 
considered as relevant for the community formation. The eigenvalue spectrum of the Laplacian operator Eq. [8] 
with the Cj^ matrix related with this user-projected weighted network, is then computed. The results for the 
eigenvalues shown in the ranking order are given in Fig. [8^. The scatter-plot of three eigenvectors belonging to 
the three lowest eigenvalues is shown in Fig. [8]5. The spectrum, as well as the scatter-plot in Fig. [HJi, indicate 
that five agent-communities can be differentiated. These are denoted by G^, with k = 1,2, •• - 5 corresponding to 
top-to-bottom branches in Fig. [Sh. In the following we first identify the nodes representing the agents in each 
of these communities. Then we analyze how the communities actually evolved on that network and discuss the 
fluctuations of the emotional states of each agent in the communities through the evolution time. 

The communities on user-projected networks are our main concern, in view of the collective behavior of the 
emotional agents. Formally, the same methodology can be applied to the post-projected network as well as the 



10 



weighted bipartite network directly. Studies of the empirical data IfTOl [T4l suggest that the communities of posts 
often appear in relation to their subjects, age, and sometimes authorship. The features is prominent in the case 
of Blogs of "normal" popularity. Whereas, in the communities on popular posts the subjects are often mixed, 
leaving potentially different driving force for user's intensive activity on that posts. In our model the post have no 
defined subjects, and the agents are driven by the emotion content alone. Nevertheless, the results of the spectral 
analysis of the post-projected network reveals that several communities of the posts can be identified. The network 
projection is described in supporting information and the scatter-plot of the respective eigenvectors is shown in 
Fig. S4. Four communities can be differentiated. Looking at the node's identity in these branches, we find that 
the "age" of the posts prevails as a grouping principle. Other attributes of posts that we have in the model, as 
"authorship", "popularity" and "charge", seem to be mixed in all present communities. 



4 Discussion 

4.1 Patterns of agents behavior in time and space of the emotion variables 

Having identified the agents in each of the communities, we can track of their group activity and the emotion 
fluctuations over time from our simulation data. The time-series of the number of comments of all agents in a 
given community G/^ are shown in Fig.|9^ and the emotional charge of the valence of these comments — in the Fig. 
I9J). Note that a fraction of comments with the valence values close to zero in the range between (—0.01, +0.01) 
are considered as neutral, and do not contribute to the charge. The profile of the time series indicates that all 
communities started to grow at early stages of the network evolution. However, two of them, Gi and G5, ceased 
to grow and reduced the activity relatively quickly after their appearance. Looking at the fluctuations of charge 
of the emotional comments in these two communities, we find that it is well balanced, fluctuating around zero at 
early times, and eventually leveling up to zero. Whereas, in the other two medium-size communities, G2,G4, the 
activity is slowly decreasing, while the largest central community, G3, shows constantly large activity. Comparing 
the activity (number of comments) with the fluctuations in the charge of the emotional comments, we can see that 
in these three communities the excess negative charge settles after some time, breaking the initial balance in the 
charge fluctuations. In this way our model reveals the correlations between the prolonged activity and the size 
of a community (i.e., number of different agents), on one side, with the occurrence of the negative charge of the 
related comments, on the other, a feature also observed in the empirical data on Blogs and Diggs lfT4l[T5l . 

In view of the preference towards the posts with negative charge, a comment regarding the breaking of the charge 
balance and its consequences to the network topology is in order. Note that, according to the model rules, cf. 
sec. 12.21 the probability of a post to receive first negative/positive comment depends on the valence of the agent 
who is active on that post and its similarity with the average valence of the currently active posts. Contrary to 
the naive preference towards node's degree, which is known to lead to a scale-free degree distribution in growing 
monopartite networks (for theoretical derivation and the conditions when power-law distributions occur, see Ref. 
II45I ). in our model agents preference is driven by another quality of a post node — its emotional content. Hence, in 
this process no scale-free distributions of the posts degree is expected, as also shown above. More importantly, the 
negative charge appears to have limited fluctuations. The time-series of the (negative) charge and of the number 
of comments remains stationary over large periods of time, before the activity ceases, as shown in Figs.|9]for the 
communities, and in Fig. |6]3 for the entire network. 

Another interesting feature of these communities can be observed by visualizing the patterns of their activity in 
the phase space of the emotion variables. In order to match the emotion measures accepted in the psychology 
literature, i.e., according to the 2-dimensional Russell's model 1201 [191 , we use the circumplex map suggested 
in Ref. Il46l . The values of the arousal and the valence variables are thus mapped onto a surface enclosed by a 
circle, where, according to Refs. 1191 l20l . the emotions commonly known, as for instance, "afraid", "astonished", 
"bored", "depressed", etc., can be represented by different points (or segments) of the surface. In particular, the 
values of the arousal and valence are mapped as follows 1461 : 
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where z = m/«(|fl' |, |v|) and a' e [—1, +1] is obtained from our arousal variable by first mapping a' = 2fl — 1. 

Computing the transformed values of the arousal and the valence for each agent in a given community at all time 
steps when an action of that agent is recorded in our simulations, we obtain the color-plots shown in Figs. [TO] 
Specifically, the color map indicates how often a particular state on the circumplex was occupied in four of the 
above communities, normalized with the all actions in that community. As the Figure [TO] shows, the communities 
leave different patterns in the space of emotions. For instance, the community G\, that have balanced charge 
fluctuations, appears to cover a larger variety of the emotional states, leading to the pattern on the top left figure. 
Whereas, when a large community is formed, it may induce large negative fields which keep the agents in the 
negative valence area of the circumplex map. The situation corresponding to the community G3 is shown in 
bottom left plot in Fig. (TO] Majority of the comments is this case are centered in the area of the arousal and 
the valence where the negative emotional states known as "worried", "apathetic", and "suspicious", "impatient", 
"annoyed" etc, are found on the circumplex map (see Refs. |fT9ll46l for coordinates of some other well known 
emotional states covered by these patterns). Plots on the right-hand side of Fi g . [TOl corre spond to the communities 
G2 and G4, in which charge fluctuations are moderately negative, as discussed above. 

From the Figures [TO] we can also observe that the well defined lower bound for the agent's arousal emerges in a 
self-organized manner inside each community, although no sharp threshold exists in the model rules. Moreover, 
the arousal drives the valence when the agents are active. This is clearly displayed in the case of communities 
with a balanced charge, as our community G\, Fig.[TO]top left. Similar pattern of the arousal-valence was found 
in the laboratory experiments 1471 , where the values of the arousal and valence are inferred from skin conduction, 
heart beat and facial expression measurements on users reading a selection of posted texts. 

The characteristic patterns in Figs.[TO]emanating from the emotional blogging of our agents suggest the processes 
with anomalous diffusion, in which certain parts of the phase space are more often visited than the others. They 
reflect the self-organized dynamics of the agent's emotion variables and the network topology. Formally, these 
patterns are between two extreme situations; synchronous behavior, focusing at a lower-dimensional areas, and 
random diffusion, spreading evenly over the entire space. The most visited areas of the phase space are in the 
vicinity of the attractors of the nonlinear maps. The positions of these attractors for each agent map move, 
depending on the actual values of the fields acting on it. The fields themselves fluctuate over time for each map, 
being tunned by the the agent's emotion variables and the local topology of the network (community) where the 
agent is situated. 

4.2 Conclusions 

In this work we have adapted the idea of the emotional agents with two-dimensional emotion states following 
Ref. ||29l , and implemented them onto a networked environment with a bipartite network of agents (users) and 
posts, to model the dynamics on Blogs and similar Web portals, where the emotions are communicated indirectly 
via posts. In addition to the emotions, measured by the dynamical variables of the arousal and the valence of 
each agent, appearance of our agents on the network follows circadian cycles and and the action-delay from a 
distribution, which is, like some other parameters of the dynamics, inferred from the empirical data. 

Novel and the most important features of our agent-based model are: 

• the agent's emotion is spreading via indirect contacts through their actions on the posts on a bipartite 
network, and 

• the network itself evolves through the addition of the agents and their actions on the posts. 

• We also present a systematic methodology to extract the relevant parameters of the model from the empirical 
datasets of high temporal resolution. 

Apart from the inference of the control parameters from a given empirical dataset, the structure of the model allows 
experimenting on the dynamics and potential extensions at three different levels; (a) changing the properties 
of each individual agent; (b) modifying the linking rules, i.e., posts exposure, agent's preferences, role of the 
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emotions, etc.; (c) introducing different and/or additional driving of the system, and varying the balance between 
the influences of the local and the global events. 

In this work the focus was on the following aspects of the dynamics: 

• the emergence of the collective states due to agent's emotional communications; 

• the patterns of their emotional behaviors in time and in phase space of emotion variables; 

• the role of underlying contacts and the structure of emergent network. 

For this purpose, in the simulations we consider no external input to drive the system except for the agents arrivals 
according to the time signal {p{t)}, inferred from the empirical data of the discussion-driven popular Diggs. We 
have demonstrated that the communities of agents emerge through the emotional commenting of posts, and can 
be identified as topological subgraphs on the weighted user-projected network. Most of the activity of the (user) 
agent occurs inside the community where the agent belongs. Moreover, the growth of the community is self- 
amplified and prolonged with the excess negative charge of the related comments. Another quantitative measure 
of the collective behavior is found in building the correlations in the streams of events — 1 /v'^'-type of the power 
spectrum for the time series is found for the emotional comments, in the response to the driving signal, which has 
weaker correlations. Moreover, this also applies for the time-series of comments with positive/negative emotion 
valence. These features are in qualitative agreement with those found in the empirical data from the discussion- 
driven Diggs and Blogs, analysed by the same mathematical approaches 1341 [TSll . cf. also Fig. |2] Hence we 
can conclude that the dynamic rules of our agent-based model reveal the key mechanisms behind the collective 
emotional behaviors, which are observed on the popular posts in real Blogs and Diggs. 

To examine the role of the driving signal in building the collective response, we perform comparative simulations 
in which the features of the p{t) signal are completely ignored. Instead, we drive the system by adding a constant 
number p = 6 of agents per time step. The simulated time-series are shown in supporting information. Fig. S5a 
and its power-spectra in Fig. S5b. Apart from higher average activity and the absence of daily cycles, the fractality 
of the time-series is preserved. The power-spectrum with the exponent ^ = 1 .25 is observed for the number of 
comments, although in a smaller range. The negative charge sets-in after certain time period and fluctuates in a 
stationary manner These results suggest that the occurrence of long-range temporal correlations is inherent to the 
stochastic process of our model, which might be enhanced, but not imported, by the profile of the driving signal. 
It is also interesting to point that the network topology grown in this process exhibits the degree distributions of 
the same type as Eq. (|7]i and three communities of the emotional agents, as shown in Fig. S5c and d. 

Compared with the empirical data, where the emotion is extracted only from the text on posts, in the agent-based 
model we can follow the fluctuations of the emotion of each agent over time. In this respect our model can 
interpolate between the psychology experiments at individual users, on one side, and the global emotional states, 
that can be recognized at larger scale [8], on the other. We have shown that the activities within each community 
of the agents leave a characteristic pattern in the space of emotions. Specifically, "normal" blogging leads to 
balanced emotions in a segment of the circumplex map, driven by large arousal (i.e., between the lines marking the 
limits of "high power/control" and "obstructive" emotions in Ref. 1191 ). This is in agreement with the laboratory 
measurements of the emotional states obtained on individual users pTllTSl . However, their collective behaviors 
may be entirely different. Our simulations suggests how the large communities can get caught into excessive 
negative emotions (critique), which prolongs the activity and the number of agents involved. The model provides 
the underlying mechanisms and the parameters through which such collective behavior may be influenced. 

In a broader view of the science of affective computing, our work aims to quantitative accounting of the dynamics 
of emotions. With the rules and parameters motivated by the discussions on popular Diggs, our agent-based 
model describes the dynamic environment with indirect communications among users, where the critique prevails 
over positive emotions. Keeping track of the relations between simulated events in terms of an evolving bipartite 
network is another aspect of this work, which makes the basis for the quantitative analysis of agents collective 
behaviors. Furthermore, the network with its local structure is essential for the discussion type dynamics that 
we study. By taking away the network of posts, the agents would be literally disconnected and their dynamics 
greatly simplified, consisting of a single appearance-and-relaxation event per agent. The role of the network 
structure can be altered by increasing the mean-field term (i.e., its strength q and the selection of posts which 
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contribute to it), or by adding an external noise to Eqs. (I2]|3]l which acts directly to each agent, as in Ref. ||29l . 
Qualitative agreement of the simulation results with the ones of the empirical dataset indicates how the emotional 
communications prevail on popular posts, leading to the bursting events and the emergence of communities. 
Whereas, the quantitative differences, in particular in the user heterogeneity shown in Fig. S3 compared to the 
the degree distribution of our emotional agents in Fig.[??] suggest to what extent features other than emotion may 
drive users behavior Potentially different outcomes can be predicted by the model when parameters are changed 
or extracted from another dataset. On the other hand, when the posts of "normal" popularity are considered, we 
expect that subjects of the posts may play a role, affecting the way that certain emotions are communicated. Such 
situations can be studied by appropriate modifications of the dynamic rules within our model and inference of the 
related parameters. 

In conclusion, compared to extracting user's emotional behavior from texts of Blogs and Diggs by quantitative 
analysis of the empirical datasets, the agent-based modeling has the advantage in that the emotional processes 
are studied at each agent (representing user) directly. Moreover, variety of social experiments can be devised and 
performed on the agents, thus avoiding the ethical issues related with the real users. Despite its mathematical 
complexity, our agent-based model obviously represents simplified reality both in respect to the dimensionality 
of emotions and the agent properties. For instance, in the science of emotion it is known that social aspects of the 
emotions such as "guilt", "pride", "shame" are different compared to "anger", "depression", "joy", and others. 
The psychological theory also suggests that personality profile determines how certain emotion is expressed. The 
theoretical modeling along the lines of our agent-based model will benefit from current developments in the sci- 
ence of affective computing, which aims for quantitative measures behind the psychology theory and for extracting 
different dimensions of emotions [22l- The agent properties can also include more realistic personal differences 
and influence of the real-world processes on users decisions and rhythm of their stepping into the virtual world. In 
the present model such real-world processes are implicitly taken into account through the parameters — the driving 
signal {p{t)} and the delay-time distribution P(Af ), which are involved in generating the respective empirical data 
of Diggs. This is a step beyond the first approximation, which we hope is opening the way for further research 
towards realistic modeling of the emotion dynamics on the Web. 



5 Figure Legends and Supporting Information 

Figure l:Patterns of activity differ for user and post nodes. Example of temporal patterns of (a) user actions 
and (b) activity at posts, obtained from the original dataset of discussion-driven Diggs (ddDiggs). Indexes are 
ordered by the user (post) first appearance in the dataset, while time is given in minutes. 

Figure 2:Time-series with circadian cycles and fractal features, (bottom) Time-series of the number of new 
users (red-dark) and the number of all active users (green-pale) per time bin of 5min derived from the ddDiggs 
dataset; (top) Power spectra of these time series as indicated (shifted vertically for better vision). Daily and weekly 
cycles can be easily noticed on both plots. 

Figure 3: Two-dimensional maps of the emotion variables of an isolated user-node. Maps for arousal (left) 
and valence (right) for four different values of the fields are shown. The fixed lmeX{t +1) ~X{t) is indicated. 

Figure 4:Parameters of the model as inferred from the empirical data of ddDiggs. (a) Distribution of g — the 
fraction of new posts per user, relative to all posts on which that user was active, averaged over all users in the 
dataset. (b) Probability fi that a user looks at a post which is older than the specified time window 7b time bins, 
averaged over all users and plotted against Tq. (c) Distribution of the time-delay At between two consecutive user 
actions, averaged over all users in the dataset. (d) Distribution of the life-time of posts tp, averaged over all posts 
in the dataset (log-binned data). In Figs, (c) and (d) the time axis is given in the number of time bins, each time 
bin corresponds to = 5 minutes of real time. 

Figure 5:Valence and arousal of each agent linked on the network fluctuate in time. Two examples of the 
valence and the arousal are shown against time for two agents (users) located in different areas of the bipartite 
network, resulting in different activity patterns: a very active agent (left) and a sporadically active agent (right). 
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Figure 6: Time series simulated in the model of interacting emotional agents on bipartite network. The 

number of all comments per time step (cyan) and the number of comments with positive (red) and negative 
(black) valence per time step (a), and the double-logarithmic plot of their power spectra (c). The number of active 
posts (indigo) and the number of active agents (magenta) per time step, and the charge of all emotional comments 
(blue), are shown in panel (b), and their corresponding power spectra, panel (d). Straight lines indicate slopes 
0={1, 1.33, 0}. For clear display, the power-spectra are logaritmically binned. 

Figure 7: Emergent bipartite network of the emotional agents and the posts exhibits strong inhomogeneity. 

Shown is a part of the network structure in the vicinity of three popular posts. 

Figure 8: Some topology measures of the bipartite network emerging in the dynamics of emotional agents. 

(a) The degree distributions of the user-partition (0-+) ^nd the post-partition (□,x). Fitting lines explained in 
the Legend, (b) Assortativity measures: The average degree of the posts linked to the user node of a given degree 
versus user degree (0-+)' ™d the average user degree linked to the post of a given degree, plotted against post 
degree (□,x). Empty symbols are for the simulation time 16384 steps, while the crosses indicate the respective 
results from runs with 25000 time steps. 

Figure 9: Spectral analysis of the emergent network reveals community structure. For the agent-projected 
network (a) the eigenvalue spectrum, and (b) the scatter-plot of three eigenvectors belonging to the lowest nonzero 
eigenvalues, indicate five communities Gjt, ^ = 1 , 2 • • • 5, related to five branches in the scatter-plot, from top to 
bottom. Note that each point in the scatter-plot represents a unique node on the network. 

Figure 10: Active communities grow in correlation with the excess of negative charge. Time series of the 
number of comments by the agents belonging to a given community (a), and the charge of these comments 

(b) , computed for each of five communities identified in Fig. [8] 

Figure 11: Activity patterns of the communities projected onto 2-dimensional space of emotions. Circum- 
plex map of the emotional states of the agents belonging to four communities identified on the emergent network: 
Gi-top left, G2-top right, Gs-bottom left, G4 -bottom right. Color map indicates occupancy of a given state, 
normalized relative to the number of comments in each community. 

Figure SI: Prevalence of negative comments on popular Diggs. Shown are the time-series and their power-spectra 
of the number of comments N±{t) carrying positive emotion (red) and negative emotion (black), and the time- 
series of "charge" of the emotional comments, Q{t) ^ N+{t) — A^_(f) (cyan). The Figure shows that comments 
are correlated in time, leading to 1 /v-type of power spectra and that the negative comments prevail. 

Video S2: User interests shift daily towards new posts. The movie shows the evolution of activity of one com- 
munity on the weighted bipartite network of discussion driven Digg stories during 90 days. In Ref. fl5l . we 
have identified such communities in the network constructed from the subset of the discussion-driven Diggs and 
keeping only very active users (with more than 100 comments). The network is then analyzed by the eigenvalue 
spectral methods ||39ll and three communities are identified fT5\. For the movie, we selected nodes belonging to 
one of these communities, g2, which consists of 236 users and posts, and then determined weighted bipartite 
subnetworks of it for each consecutive day. Thus, the weight of the link corresponds to the number of comments 
written by the user on the post on a given day, while the color of the link indicates overall emotional contents 
of these comments (black-negative, white-neutral and red-positive). The networks are then visualized using the 
program Pajek. Every frame corresponds to the window of one day in real time. The frames are combined using 
Avidemux program package. The movie shows that on each day different posts were in the focus. 

Figure S3: Some topology measures of the bipartite network derived from empirical data on Diggs. For com- 
parison of the network topology, the dataset of the popular discussion-driven Diggs, described in section [TTl is 
mapped onto a weighted bipartite network, with the weights of links representing the number of comments of a 
user to a post. The degree distribution of user-nodes and post-nodes of that network are computed and shown in 
Fig. S3(a), while the assortativity measures — the average degree of the post nodes linked to the user of a given 
degree, and vice versa, the average degree of the user nodes linked to the post of a given degree, are shown in 
Fig. S3(b). For completeness, shown are also the respective quantities computed from whole dataset of all Diggs 
(including the posts with normal popularity), indicated in the Legend. 

Figure S4: Community structure of post-projected network obtained from the emotional agents dynamics. Shown 
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is the scatter-plot of the eigenvectors belonging to three lowest eigenvalues of the post-projected weighted bipartite 
network, emerging in the simulations of the emotional agents dynamics after 4000 time steps. The network of size 
is reduced to A^p = 1 156 posts by taking the posts with strength Ip > 50, as relevant for the community formation. 

Figure S5: When the bloggers never sleep. Simulations results obtained for the system of the emotional agents 
is driven by adding a constant number of users p = 6 per time step: Time-series of the number of comments and 
charge (a) and their power-spectra (b). Degree distributions of user-nodes and post-nodes, P{qu) and P{qp), are 
shown in panel (c). Scatter-plot of the eigenvectors related with the user-projected network {Nu = 4418 users with 
strength (.u > 10 considered), indicating the community structure (d). 
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Figure 12: S3 
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Figure 13: S4 
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